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Abstract — This paper presents a model of NAND flash SSD utilization and write amplification when the ATA/ATAPI SSD Trim 
command is incorporated into object-based storage under a variety of user workloads, including a uniform random workload 
with objects of fixed size and a uniform random workload with objects of varying sizes. We first summarize the existing models 
for write amplification in SSDs for workloads with and without the Trim command, then propose an alteration of the models that 
utilizes a framework of object-based storage. The utilization of objects and pages in the SSD is derived, with the analytic results 
compared to simulation. Finally the effect of objects on write amplification and its computation is discussed along with a potential 
application to optimization of SSD usage through object storage metadata servers that allocate object classes of distinct object 
size. 

Index Terms — Trim, Solid State Drive, Object-Based Storage, Write Amplification, NAND Flash, Markov processes. 



> 

in 
in 



X 



1 Introduction 

IN recent years, NAND flash solid state drives 
(SSDs) have become ubiquitous, appearing in small 
imobile electronics to large data centers. Part of the 
popularity of flash-based SSDs over traditional hard 
disk drives (HDDs) is the increased speed and per- 
formance that they offer. However, SSDs have their 
own set of problems, including a slowdown in per- 
formance as the device fills with data due to write 
amplification, and a limited lifetime due to wearout 
of the floating-gate transistors. Additionally, SSDs 
currently need to conform with legacy interfaces from 
HDDs, which makes optimization in using the device 
more challenging. 

Object based storage offers multiple advantages for 
systems using solid state storage devices [1]. When the 
multiple thousand pages in a flash block are divided 
into object records of larger size, the resultant fewer 
objects per block directly translates into fewer data 
write relocations during garbage collection, leading to 
a lower write amplification and faster performance. 
This offers an opportunity for user application stor- 
age metadata, via metadata servers, to actively allow 
flash storage devices to self-optimize their hardware, 
to fully utilize SSD write speed advantages over 
magnetic HDD while ameliorating SSD limitations 
like block erase and finite write lifetime. Object stor- 
age metadata servers could allocate SSDs to object 
classes 12J of distinct object size and thereby allow 
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SSD to internally manage their object physical map- 
ping and relocation. This has the potential to improve 
I/O rate and QoS, user access blocking probability, 
and SSD flash chip lifetime through the reduction of 
write amplification. 

In this paper, we present a model for computing uti- 
lization and write amplification under a modified ran- 
dom workload including Write and Trim commands, 
and data records of varying sizes. This model is an 
extension of models proposed for simpler workloads 
by previous researchers |3l, fH, [|5|, |6| and builds on 
our previous models of the Trim command [7J, |8|, |9|. 
Good models assist in the development of algorithms 
for managing the challenges of SSDs; our model adds 
flexibility to the theory in order to more closely match 
real-world workload characteristics. 

2 NAND Flash SSDs 

In this section, we provide background about NAND 
Flash SSDs which is necessary to understand the 
model presented in this paper. 

2.1 Layout 

The organization of a NAND flash SSD gives it char- 
acteristics that cause unique challenges. The smallest 
storage unit of flash is the page, which typically holds 
2K, 4K, or 8K of data [TO'I, flT|. Pages are grouped into 
blocks, with blocks typically containing 64, 128, or 256 
pages, varying by manufacturer ITOji . One challenge of 
flash SSDs is that a page must be erased before it can 
be programmed again, but the smallest erase unit of 
flash SSDs is the block, and not all pages on the block 
are ready to be erased at the same time. This means 
that write-in-place schemes are impractical, and most 
SSDs employ a dynamic logical-to-physical mapping 



2 



called the flash translation layer (FTL) Qa, With 
this dynamic mapping, when data is overwritten, 
the old physical location of the data is marked as 
invalidated in the FTL mapping. In this paper, we 
utilize a log-structured file system, in which data is 
written to successively available pages, as described 
in 0. 

2.2 Garbage Collection 

Garbage collection is the process of selecting and 
erasing a block to reclaim space for writing. When 
a block is chosen for erasure, any valid pages on 
the block first need to be copied elsewhere to avoid 
data loss. Copying these valid pages leads to write 
amplification, a phenomenon in which the device 
performs more writes than are requested of it. 

There are many algorithms used to select a block for 
garbage collection, most of which are based on one of 
two schemes originally described in the seminal paper 
by Rosenblum and Ousterhout [14J. Their first method 
of selection chooses the block that creates the most 
free space, and their second method also accounts for 
the age of the block. In this paper, we utilize greedy 
garbage collection, in which the block with the fewest 
valid pages is selected. 

2.3 Improving Performance 

Reducing write amplification creates an improvement 
in performance by increasing the average write speed 
due to the fewer number of extra writes performed by 
the device. Additionally, by reducing the number of 
program / erase cycles, SSD lifetime is extended, since 
flash devices wear out and lose the ability to store 
data after being written many times. 

There are several ways to reduce write amplifi- 
cation. One common way is for manufacturers to 
provide extra storage space on the SSD beyond what 
the user is allowed to access, a practice called over- 
provisioning. Overprovisioning reduces write ampli- 
fication because the amount of time between garbage 
collections increases, which tends to lead to a decrease 
in the number of valid pages in the block selected for 
reclamation. We measure the amount of overprovi- 
sioning with the spare factor, Sf = {T — u) /T, < 
Sf < 1, where T is the total number of pages on the 
SSD, and u is the number of pages in the user space. 

Another way to reduce write amplification is 
through use of the Trim command fl5|, which is a 
way for the file system to communicate to the SSD 
that certain data pages no longer need to be retained. 
The pages associated with the data are invalidated in 
the FTL mapping, meaning that there are fewer valid 
pages that need copying during garbage collection. 

3 Analyzing Performance 

Several researchers have created analytic models to 
help understand the performance of SSDs. In this 



section, we summarize previous work that we build 
on in this paper. 

3.1 Uniform Random Workload 

The type of workload used can have a large effect 
on the performance of a storage device ||T6ll . We 
view real-world workloads as existing along a con- 
tinuum from purely sequential workloads to com- 
pletely random workloads. For SSDs, analysis of a 
purely sequential workload is uninteresting, because 
entire blocks of data are invalidated before garbage 
collection, leading to no write amplification |3|. How- 
ever, at the other end of the spectrum, completely 
random workloads are very interesting to statistically 
analyze, since the random nature of the writes results 
in most blocks still containing valid pages at the 
time of garbage collection, leading to potentially high 
write amplification. Hu, et al. lH, HI, Agarwal and 
Marrow [5J, and Xiang and Kurkoski [6] have all pro- 
posed models for write amplification of the uniform 
random workload under greedy garbage collection. 
Their workloads consist of write requests distributed 
uniformly randomly over the user space, with each 
logical block address (LBA) requested requiring one 
page to store the data. Each of their models found that 
the single most important factor in determining the 
write amplification was the level of overprovisioning 
provided on the SSD. 

3.2 Trim-Modified Uniform Random Workload 

Previously, we modified the uniform random work- 
load to incorporate Trim requests, then analyzed the 
resulting utilization and write amplification |7|, |8|, 
[9J. In the Trim-modified uniform random workload, 
both write and Trim requests are allowed, and the 
data associated with each LBA occupies one page. 
As in the standard uniform random workload, write 
requests are uniformly randomly chosen over all u 
user LB As, and Trim requests are uniformly randomly 
chosen over all In-Use LB As, where an LBA is defined 
as being In-Use if its most recent request was a Write. 
Trim requests occur with probability q, and write 
requests occur with probability 1 — q. This workload 
was modeled as a Markov birth-death chain with the 
state being the number of In-Use LBAs, and the steady 
state solution was approximated as Gaussian with 
mean us and variance us, where s = (1 — 2q)/{l — q) 
and s = q/{l — q) are dependent on the rate of Trim. 

3.2. 1 Effective Overprovisioning 

Since the number of In-Use LBAs is the same as the 
number of valid pages, the level of effective overpro- 
visioning is straightforward to calculate. The effective 
overprovisioning is measured in terms of the effective 
spare factor, S'eff = (T — Xn)/T, where Xn is the 
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number of valid pages at time n. The mean effective 
spare factor was found to be 

- T-us 

^ eff — 

= s-\-sSf 

The variance of the effective spare factor was also 
calculated, and found to be negligible in practical 
devices (as T oo) [7], \9j. 

3.2.2 Write Amplification 

We also explored write amplification under the Trim- 
modified random workload. After paralleling the 
derivation of Xiang and Kurkoski |6|, we discovered 
that all three previously proposed models for write 
amplification could be modified by using the effective 
level of overprovisioning. In simulation, the Trim- 
modified model of Xiang and Kurkoski provided the 
best prediction; the Trim-modified models of Hu, et 
al., and of Agarwal and Marrow were optimistic in 
their predictions of write amplification |8|, [9]. 

Additionally, we separated the workload into hot 
(frequently accessed) and cold (infrequently accessed) 
data in order to closer approximate a real-work work- 
load. Analysis of this workload led to a more general 
workload with N temperatures, and found that the 
write amplification of this workload type was lowest 
when the N temperatures were kept physically sepa- 
rated into different blocks HI. 

4 Model Augmentation 

In the previous work described in Section |3j one 
page stored the data associated with one LBA. In 
this section, we augment the model to allow larger 
amounts of data to be stored under a single identifier, 
which lends itself naturally to a discussion of object- 
based storage. 

4.1 From LBAs to Objects 

In object-based storage, the filesystem does not assign 
data to particular LBAs. Instead, the filesystem tells 
the object-based storage disk to store certain data; the 
disk chooses where to store it, and gives the filesys- 
tem a unique object ID with which the data can be 
retrieved and/or modified [17]. Object-based storage 
allows for variable-sized objects with identifiers that 
need not indicate any information about the physical 
location of the data [18]. Rajimwale, et al. have pro- 
posed that object-based storage is an optimal interface 
for flash-based SSDs [19J. We use object-based storage 
as motivation for extensions to the theory of effective 
overprovisioning and write amplification described in 
Section |3] 

The utilization problem we solved previously |7J 
can be viewed in object space instead of LBA space. 
Instead of LBAs that are In-Use or not In-Use, there 



are object IDs that are In-Use or not In-Use. Previ- 
ously, there were u LBAs allowed; now we allow 
object IDs in the range [l,u], for a maximum of u 
objects. The object-view of the Trim-modified uniform 
random workload is the same as the LBA view of the 
problem when each object occupies 1 page. 

To remove the restriction that one object occupies 
only one page, we start by examining the workload 
in which each object occupies b pages, where b is 
constant and identical for all objects in the workload. 
Then, we examine the workload in which b varies 
each time an object is written. For all workloads, 
we assume that when an object ID is selected for 
trimming (or, more appropriately for objects, dele- 
tion), all pages associated with that object ID are 
invalidated. Likewise, when an object is rewritten, all 
pages previously containing data associated with that 
object ID are invalidated. Note that there are now two 
quantities of interest: the number of object IDs that are 
in-use, and the number of pages that are valid. When 
each object occupies one page, these two quantities 
are the same. 

Define the following variables: 

Xt = {xi,t,X2,t, ■ ■ ■ ,Xu,tV is a vector of indicator 
random variables at time t. 

^ ^ ^ 1 1 Lj e A.i (object i is In-Use) 

1 otherwise (object i is not In-Use) 

where 

Ai — {Lj|the STATE ((x)) is such that location i is occupied} 

-^t = X]r=i Xi-t is a random variable representing 
the number of objects in-use at time t. The mean and 
variance of this random variable at steady state were 
computed previously by Frankie, et al. IZl, El. 

Zt = (^i,t, Z2.t, • ■ . , Zu,t)^ is a vector of the size of 
each object at time t. 

Now, compute the number of valid pages at time t: 

— Xt = Yll=i Xi.tZi^t 

We are concerned with the values at steady state, 
and drop the time index to indicate the steady state 
solution. 

4.2 Fixed Sized Objects 

This workload is an extension of the original work- 
load whose steady state pdf was derived in [7J. How- 
ever, objects now occupy b pages instead of 1 page. 
Recall that the number of In-Use objects is the same as 
the number of In-Use LBAs previously derived; only 
the number of valid pages will change. We assume 
that at least ub pages are allowed for the user to write 
to, so that it is not possible to request more space than 
is available. 
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For this workload, Zt ~ {b,b, . . . , b)'^ Vt. This means 

u 

i=l 
u 

= I]XM& 

u 

= b^Xi.t 

i=l 

= bXt 

To compute the steady state mean and variance of Y , 
we can use the known steady state mean and variance 
of X. 

E\Y] = E[hX] 
= bE[X] 
— bus 



Similarly, 

Var[Y\ = Var[bX] 

= b^VarlX] 

= b^us 

4.3 Variable Sized Objects 

In this workload, the size of each object varies in- 
dependently over time. Each time an object ID is 
selected to be written, the size of the object is chosen 
randomly, according to the distribution from which 
Zi_t is drawn, regardless of any size that object ID may 
have had in the past. t is drawn from a distribution 
with a minimum value Bi > 1, a maximum value 
B2 > Bi, a mean value mz, and a standard deviation 
(Tz- Again, we assume there is sufficient space for 
writing all requests; this means there are at least 
UB2 pages allowed for the user Because Zij is not 
constant, Yt — J2^=i Xi,tZi^t cannot be simplified, as it 
was in the fixed-sized object case. 

The mean is still relatively straightforward to cal- 
culate, since Xi independent, and the ZiS 
are iid: 



E[Y] = E 



u 

= Y.E[xr]E[Z., 

u 

= ^E[xt]mz 
1=1 

u 

= mz^E[x^] 

1=1 



mzE 



.2=1 



= mzE[X] 
= mzus 



(1) 



The variance is more complicated because of cross- 
terms. The Xi values are not independent, so it is not 
simple to compute the varianc^ 

To compute the variance of Y , compute 

Var{Y) = ElY"^] - E[Yf 

The value of E\Y] was computed in Equation [T] Com- 
puting is more complicated. When following 
the steps below, note that xi — Xi ' Xi is independent 
of Zi and Z.j, and the Zi values are iid. 

^ u \ 2 

E\Y-'] = E \Y.X^Z^] 



E 



^XiZf + 2'Y^ XiX^,z^ZJ 

Y,E[x,Z}]+2Y,E[x^XiZa,] 



Y.E[x.]E[Z}] 
+2Y,E[x^X^E[Z,]E[Z,] 



i<3 



Most of these expected values are straightforward 
to calculate. However, care must be taken when 
computing E[xiXj]r because the Xi values are NOT 
independent. See Section 4.3.1 for details. 
Continuing, 



i<J 



{u - l)u s (us - 1 + j) 



= a%us + rn^yus + m^us [us — 1 



u 
s 



J Z I tiLz'^^ I '"'Z 



= azus + (nizus) + nizUS 

Now it is possible to compute 

Var{Y) = E[Y'^] - E[Y]^ 

— a%us + (jnzusY + m\us — (rnzus)'^ 
~ azUS + ni\us 



4.3.1 Computing E [xi Xj ] 

As mentioned previously, care must be taken when 
computing E[xiXj]f because the Xi values are NOT 
independent. 

E[x^XJ] - PiAnA,) 

= P{A,\A,)P{A,) 

1. If the Xi values were independent, X would have a binomial 
distribution, but because we know from previous analysis that the 
variance is us and not uss, this is obviously not the case. 
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Obviously, when i = j, P{Ai\Aj) = 1. However, 
calculating this value when i ^ j is important. Since 
there is nothing special about any particular i or j, 
assume that P{Ai\Aj) is the same for all i ^ j, and 
thus is a constant with respect to i and j. 

It is tempting to claim that P{Ai\Aj) — (us — — 
1), since us is the expected number of In-Use objects 
at steady state. However, this is an invalid statement 
because it implicitly assumes a zero variance. In fact, 
using this value in the calculation of Var{Y) results 
in a statement that Var{Y) = 0, which is exactly 
what is expected because of the implicit zero variance 
assumption. 

Instead of trying to compute P{Ai\Aj) directly, it is 
possible to compute it by using the result of the one 
object occupies one page model. Working backwards 
to calculate this probability, and noting that there are 
terms in the sum over 



X 10' 



■Comparison of Analytic and Simulation pdf 



I < j: 



Var{Y) 



U 

Var (x.) + 2 ^ Cov{x^,XJ) 



i<j 



US 



Y^ss + 2Y^C 

i=l i<j 

uss + 2u{u — 1)C 



_2 

Solving for C yields which is the value of the 
covariance|^ Continuing with the calculations. 



Cov{xt,Xj) 



c 

E[X^X^-P{A^)P{A,) 

E[X^X^ - 



so that 



EiXiXj] 



C + s^ 

-2 



U- 1 

s (us — 1 



u-l 



(2) 



4.4 Simulation Results 



Monte-Carlo simulation was used to verify the an- 
alytic results, with various values of u, q, and dis- 
tributions of Z. In this paper, results are shown for 
u = 1000. Simulations were run for 1 million requests 
to ensure steady state was reached, then the simu- 
lation means and variances were averaged over the 

2. Interestingly, as u gets large, the covariance goes to 0, indi- 
cating that the Xi values are effectively independent for large u. 
However, the rate at which the covariance goes to is counter- 
acted by the rate at which the number of terms in the sum over 
i < j increases, resulting in a variance that is NOT binomially 
distributed. 
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Fig. 1 . Comparison of analytic and simulation pdf for 
the number of valid pages when objects are a fixed size 
of 32 pages. For this example, u = 1000 and q = 0.2. 



next 9 million requests. The results in each table are 
the average of 64 Monte-Carlo simulations. Although 
results are only shown for u = 1000, results for larger 
values of u are comparable. 

Table [T| shows the analytic and simulation values 
of the mean and standard deviation of the number 
of In-Use objects as well as the mean and standard 
deviation of the number of valid pages at steady state 
when the objects are all a fixed size of 32 pages. When 
rounded to the nearest integer, the mean number 
of objects is predicted exactly by analysis; the mean 
number of valid pages is predicted to within 3 pages. 
The slight error in the prediction of the number of 
valid pages is likely due to the extremely large vari- 
ances seen when q > 0.3. An overlay of the simulation 
pdf and the analytic pdf for the number of valid pages 
is shown in Fig. [l] for q = 0.2. 

Table |2] shows the analytic and simulation values 
of the mean and standard deviation of the number 
of In-Use objects as well as the mean and standard 
deviation of the number of valid pages at steady 
state when the size of the objects is drawn from a 
discrete uniform distribution over [1, 32]. Analytic and 
simulation values match exactly when rounded to the 
nearest integer. An overlay of the simulation pdf and 
the analytic pdf for the number of valid pages is 
shown in Fig. |2]for q = 0.2. 

Table |3] shows the analytic and simulation values 
of the mean and standard deviation of the number 
of In-Use objects as well as the mean and standard 
deviation of the number of valid pages at steady state 
when the size of the objects is drawn from a binomial 
distribution with parameters p = 0.4 and n = 32. 
Analytic and simulation values match exactly when 
rounded to the nearest integer. An overlay of the 
simulation pdf and the analytic pdf for the number 
of valid pages is shown in Fig. |3] for q = 0.2. 
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TABLE 1 

Mean and a of Number of In-Use Objects and Number of Valid Pages for Fixed Sized Objects. Values in this 
table were computed with u = 1000. The size of the objects was fixed at 32. Simulation values are the average 

of 64 Monte-Carlo simulations. 



Trim Amoimt q 


Mean Objects 


CT Objects 


Mean Pages 


cr Pages 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


0.05 


947.37 


947.37 


7.25 


7.24 


30315.79 


30315.80 


232.15 


231.80 


0.1 


888.89 


888.89 


10.54 


10.54 


28444.44 


28444.37 


337.31 


337.37 


0.2 


750.00 


750.01 


15.81 


15.82 


24000.00 


24000.37 


505.96 


506.18 


0.3 


571.43 


571.47 


20.70 


20.73 


18285.71 


18287.10 


662.46 


663.29 


0.4 


333.33 


333.43 


25.82 


25.83 


10666.67 


10669.70 


826.24 


826.46 


0.45 


181.82 


181.89 


28.60 


28.62 


5818.18 


5820.40 


915.32 


915.93 



TABLE 2 

Mean and a of Number of In-Use Objects and Number of Valid Pages for Variable Sized Objects. Values in this 
table were computed with u = 1000. The size of the objects was drawn from a discrete uniform distribution over 
[1, 32]. Simulation values are the average of 64 Monte-Carlo simulations. 



Trim Amoimt q 


Mean Objects 


cr Objects 


Mean Pages 


a Pages 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


0.05 


947.37 


947.36 


7.25 


7.27 


15631.58 


15631.33 


308.37 


307.92 


0.1 


888.89 


888.92 


10.54 


10.53 


14666.67 


14667.03 


325.62 


325.63 


0.2 


750.00 


750.00 


15.81 


15.80 


12375.00 


12374.67 


363.32 


363.42 


0.3 


571.43 


571.41 


20.70 


20.70 


9428.57 


9429.00 


406.69 


406.49 


0.4 


333.33 


333.35 


25.82 


25.79 


5500.00 


5500.13 


458.17 


457.71 


0.45 


181.82 


181.81 


28.60 


28.58 


3000.00 


2999.85 


488.11 


487.65 



TABLE 3 

Mean and a of Number of In-Use Objects and Number of Valid Pages for Variable Sized Objects. Values in this 
table were computed with u = 1000. The size of the objects was drawn from a binomial distribution over with 
parameters p = 0.4 and n = 32. Simulation values are the average of 64 Monte-Carlo simulations. 



Trim Amoimt q 


Mean Objects 


a Objects 


Mean Pages 


a Pages 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


Analytic 


Simulation 


0.05 


947.37 


947.36 


7.25 


7.27 


12126.32 


12126.39 


126.09 


126.06 


0.1 


888.89 


888.88 


10.54 


10.53 


11377.78 


11377.73 


158.21 


158.10 


0.2 


750.00 


750.01 


15.81 


15.80 


9600.00 


9600.25 


216.15 


216.00 


0.3 


571.43 


571.37 


20.70 


20.68 


7314.29 


7313.46 


273.14 


272.77 


0.4 


333.33 


333.32 


25.82 


25.79 


4266.67 


4266.57 


334.35 


334.03 


0.45 


181.82 


181.84 


28.60 


28.63 


2327.27 


2327.60 


368.03 


368.36 
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Fig. 2. Comparison of analytic and simulation pdf for 
the number of valid pages when the size of objects is 
drawn from a discrete uniform distribution over [1,32]. 
For this example, u = 1000 and q = 0.2. 
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Fig. 3. Comparison of analytic and simulation pdf for 
the number of valid pages when the size of objects is 
drawn from a binomial distribution over with parame- 
ters p 0.4 and n = 32. For this example, u = 1000 
and q = 0.2. 



5 Application to Write Amplification 

In this section, we examine write amplification when 
one object requires a fixed number of pages to store 
data. We assume that the number of pages required 
to store the data associated with the object evenly di- 
vides the number of pages per block (i.e. mod{np, b) = 
0, so that an entire object is stored within a single 
block. As in previous work, data is stored in the 
fashion of a log-structured file system with greedy 
garbage collection. 

We argue that fixed sized objects with sizes that 
evenly divide the number of pages per block is 
equivalent to reducing the number of pages in a 
block. For instance, if two objects can fit in one block, 
this is equivalent to having a block with two (large 
size) pages, and objects that require one of these 
(large) pages to store the associated data. With this 
equivalence between reduced number of pages per 
block and increased object size, the formulas for write 
amplification with Trim that we previously devel- 
oped |8| should apply. However, the Trim-modified 
Agarwal and the Trim-modified Xiang models do not 
take rip into account! The derivation of the Trim- 
modified Agarwal model causes Up to drop out nat- 
urally; it is not a good candidate for modification. 
The Trim-modified Xiang model is a better candidate, 
as asymptotically removes rip in the derivation. By 
eliminating this asymptotic approximation, the write 
amplification is computed as Up/y, where 

y= Up + 

~w (t(i + -s) (i - j^f'"^'' m (i - ^)) 
^ T 7 (3) 

and W{-) is the Lambert W function ||20l, ISI, il. The 
Trim-modified Hu model does not need to be changed 
in order to take Up into account, as it already does this. 
The model can be found in Figure |4] 

Results comparing the simulated write amplifica- 
tion and the computed write amplification are in Ta- 
ble |4] and Figure |5] In this simulation, there were 1280 
blocks, with a manufacturer-specified spare factor of 
0.2, and a Trim factor of 0.1. As the number of pages 
per block changed, the effective spare factor was 
held constant to enable comparisons. As expected, 
the write amplification was reduced as the number 
of pages per block decreased, with the minimum 
possible write amplification of 1 achieved with 1 
page in a block. The modification of the Xiang model 
shown in Equation |3] was a very poor approximation 
as the number of pages in a block decreased; this 
is likely due to the assumption that the number of 
invalid pages freed in a block by garbage collection 
is binomial; with a small number of pages per block, 
this approximation becomes too coarse. However, the 
Trim-modified Hu model provides a reasonable, if 
optimistic, prediction of write amplification. Note that 
the Trim-modified Hu model does not make sense to 
compute if there is one page per block, so the entry 
in the table is left blank. 
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Computation of Trim-Modified Write Amplifica- 
tion Factor 


Compute p~ (pi , • • 


iPj 1 ' ' ' •) Pw ) 




Pi = 


1-9 (--1) 

\us / 


(4) 




(5) 


Compute V ~ {vj,k), a matrix of size w x n 
j — 1, ■ ■ ■ ,w & A; = 0, • • • , rip — 


1 


k 






Compute V = (Vi, • • 








w— 1 

= n "j.'^ 




Compute p* = (pq , ■ 


• ^Pk^-- ,PnJ 




Po = i-Vo 






Pk = Vk-i~ Vk 


for k — 1, . . . ,np - 


^ 1 


= 1 - 






Compute the write amplification. 






rip 


(6) 


"■p ~ J2k=0 ^P*k 



Fig. 4. Equations needed to compute tine Trim- 
modified empirical model-based algorithm for Ahu 
(adapted from [3], [4] and [8]). w is the window size, 
and allows for the windowed greedy garbage collection 
variation of greedy garbage collection. Setting w = 
:^ — r is needed for the standard greedy garbage 
collection discussed in this paper. For an explanation 
of the theory behind these formulas, see [3], [4J. 



When objects do not fit neatly within blocks, such as 
the case for mod{np, b) ^ 0, for variable sized objects, 
or for objects larger than one block, the computation 
of write amplification is not as straightforward. First, 
design decisions must be made: should an object be 
split over multiple blocks, or should empty, unwritten 
pages be left on a block, that perhaps could be filled 
in later by a smaller object? The design decision will 
affect the analysis of the write amplification; this is an 
area for future research. 
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TABLE 4 

Write amplification for varying number of pages per 
block, equivalent to fixed sized objects that evenly 
divide Up. The level of effective overprovisioning was 
kept constant for each simulation. The amount of Trim 
was q 0.1. All models referenced are Trim-modified; 
the Xiang model was additionally modified to account 
for Hp, as shown in Equation |3] 





Write Amplification 


Pages 

/Block 

rip 


Simulation 


Modified 

Xiang 

Model 


Hu 

Model 


1 


1.000 


1.936 




2 


1.191 


1.937 


1.000 


4 


1.432 


1.938 


1.065 


8 


1.631 


1.938 


1.274 


16 


1.771 


1.938 


1.628 


32 


1.853 


1.938 


1.732 


64 


1.896 


1.938 


1.793 


128 


1.918 


1.938 


1.828 


256 


1.929 


1.938 


1.847 



Write Amplification vs. Pages Per Blocl< 




Fig. 5. Write amplification versus pages per block 
for constant effective overprovisioning. The modified 
Xiang model (Equation |3]l fails when the number of 
pages per block is low. The Trim-modified Hu model 
provides an optimistic, but reasonable, prediction. 

6 Conclusion 

In this paper, the Trim model we previously presented 
in JZl, ||8l, ||9| was extended to encompass the idea 
of object-based storage and allow objects to utilize 
more than one page for data storage. A utilization 
model was derived and shown to be accurate via sim- 
ulation. Additionally, models for write amplification 
were modified and compared to simulated values, 
demonstrating the failure point of one model and the 
need for a more accurate model in another Models 
such as the ones presented in this paper are useful 
tools for understanding the problems that an FTL 
design needs to address, and can lead to improved 
future designs. 

The model extension presented in this paper helps 
make the workload a more characteristic of real-world 



workloads. By using this model in conjunction with 
object-based storage, we envision metadata servers 
that separate object classes of distinct sizes to unique 
flash SSDs in order to assist in self-optimization of 
hardware that would improve overall data access 
performance and improve the lifetime of the SSD. 

References 

[I] Y. Kang, J. Yang, and E. Miller, "Efficient storage management 
for object-based flash memory," in 2010 18th Annual IEEE/ACM 
International Symposium on Modeling, Analysis and Simulation of 
Computer and Telecommunication Systems. IEEE, 2010, pp. 407- 
409. 

[2] D. Feng and L. Qin, "Adaptive object placement in object- 
based storage systems with minimal blocking probability," in 
Advanced Information Networking and Applications, 2006. AINA 
2006. 20th International Conference on, vol. 1. IEEE, 2006, pp. 
611-616. 

[3] X. Hu, E. Eleftheriou, R. Haas, I. lliadis, and R. Pletka, "Write 
amplification analysis in flash-based solid state drives," in 
Proceedings of SYSTEM 2009: The Israeli Experimental Systems 
Conference. ACM, 2009. 

[4] X. Hu and R. Haas, "The fundamental limit of flash random 
write performance: Understanding, analysis and performance 
modelling," IBM Research Report, vol. RZ 3771, Mar 2010. 

[5] R. Agarwal and M. Marrow, "A closed-form expression for 
Write Amplification in NAND Flash," in GLOBECOM Work- 
shops (GC Wkshps), 2010 IEEE. IEEE, 2010, pp. 1846-1850. 

[6] L. Xiang and B. Kurkoski, "An Improved Analytical Expres- 
sion for Write Amplification in NAND Flash," in International 
Conference on Computing, Networking, and Communications, 2012, 
Also available at Arxiv preprint arXiv:1110.4245. 

[7] T. Frankie, G. Hughes, and K. Kreutz-Delgado, "A Mathe- 
matical Model of the Trim Cormnand in NAND-Flash SSDs," 
in Proceedings of the 50th Annual Southeast Regional Conference. 
ACM, 2012. 

[8] , "The Effect of Trim Requests on Write Amplification in 

Solid State Drives," in Proceedings of the lASTED-CIIT Confer- 
ence. lASTED, 2012. 

[9] , "Analysis of trim commands on overprovisioning and 

write amplification in solid state drives," ACM Transactions on 
Storage (TOS), 2012, in Submission. 

[10] L. Grupp, A. Caulfield, J. Coburn, S. Swanson, E. Yaakobi, 
P. Siegel, and J. Wolf, "Characterizing flash memory: anoma- 
lies, observations, and applications," in Proceedings of the 42nd 
Annual IEEE/ACM International Symposium on Microarchitecture. 
ACM, 2009, pp. 24-33. 

[II] D. Roberts, T. Kgil, and T. Mudge, "Integrating NAND flash 
devices onto servers," Commun. ACM, vol. 52, no. 4, pp. 98- 
103, 2009. 

[12] T. Chung, D. Park, S. Park, D. Lee, S. Lee, and H. Song, "A sur- 
vey of flash translation layer," Journal of Systems Architecture, 
vol. 55, no. 5-6, pp. 332-343, 2009. 

[13] A. Gupta, Y. Kim, and B. Urgaonkar, "DFTL: a flash translation 
layer employing demand-based selective caching of page- 
level address mappings," in Proceedings of the 14th international 
conference on Architectural support for programming languages and 
operating systems. ACM, 2009, pp. 229-240. 

[14] M. Rosenblum and J. Ousterhout, "The design and implemen- 
tation of a log-structured file system," ACM Transactions on 
Computer Systems (TOCS), vol. 10, no. 1, pp. 26-52, 1992. 

[15] Information Technology - ATA/ATAPI Command Set - 2 (ACS-2), 
INCITS Working Draft T13/2015-D Rev. 7 ed., June 2011, 
http: / / www.tl3.org/documents/UploadedDocuments / | 
docs2011/d2U15r7-AlAAlAl-'l_Command_Set_-_2_ AC!3-!?. | 
pdf 

[16] N. Agrawal, V. Prabhakaran, T. Wobber, J. Davis, M. Manasse, 
and R. Panigrahy, "Design tradeoffs for SSD performance," in 
USENIX 2008 Annual Technical Conference on Annual Technical 
Conference. USENIX Association, 2008, pp. 57-70. 

[17] Information Technology - SCSI Object-Based Storage Device 
Commands-3 ('OSD-3j,"lNCITS Working Draft T10/2128-D Rev. 
ed., March 2009, http://www.tl0.org/cgi-bin/ac.pl?t=f&^ 
|osd3r00.pdf| 



9 



[18] Y. Kang, J. Yang, and E. Miller, "Object-based SCM: An Ef- 
ficient Interface for Storage Class Memories," in Mass Storage 
Systems and Technologies (MSST), 2011 IEEE 27th Symposium on. 
IEEE, 2011, pp. 1-12. 

[19] A. Rajimwale, V. Prabhakaran, and J. Davis, "Block manage- 
ment in solid-state devices," in Proceedings of the 2009 conference 
on USENIX Annual technical conference. USENIX Association, 
2009, pp. 21-21. 

[20] R. Corless, G. Gonnet, D. Hare, D. Jeffrey, and D. Knuth, "On 
the LambertW function," Advances in Computational mathemat- 
ics, vol. 5, no. 1, pp. 329-359, 1996. 



